WO 2005/092917 PCT/CA2004/000443 

LIBRARY OF TOXTN MUTANTS. AND METHODS O F TTS ESTG SAME 
Background of the Invention 

This application relates to libraries of toxin mutants, and to methods of using same in 
' the development of therapeutics targeted against specific cell type;s. 

■ * 

Plant and bacterial toxins have a structural organization with two or more polypeptide 
domains or subunits responsible for distinct functions, referred to is A and B, The toxins may- 
be referred to as AB X toxins where x represents the number of identical or homologous B 
subunits in the toxin. This family of framework-related toxins includes examples such as 
Shiga and Shiga-tike toxins, the E. coli heat-labile enterotoxins, cholera toxin, diphtheria 

* 

toxin, pertussis toxin, Pseudomonas aeruginosa exotoxin A (Olsnes, S. and Sandvik, K. 

* 

(198S) in Lnmunotoxins pp. 39-73, Kluwer Academic, Boston; Saadvik, K. 8 Dubinina, E., 
Garred, O., et al. (1992) Biochem. Soc. Trans. 20:724) as well as plant toxins such as ricin 
and abrin. In some cases the toxin are heteromeric, in that the B chains are actually separate 
entities that connect to the toxic A chain via a non-covalent bondir.g. In other cases, the toxin 
is monomelic,, since the B chain is part of the same protein when the toxin is produced in 
nature. 

Based on their ability to block protein synthesis, proteins s\;.ch as Shiga. and Shiga-like 

* 

toxins as. well as ricin, abrin, gelonin, crotin, pokeweed antiviral protein, saporin, momordin, 

■ 

modeccin, sarcin, diphtheria toxin and exotoxin A have been referred to as 
ribosome-inactivating proteins (RIP)- The potency of RIPs is exceedingly high; one molecule 
of diphtheria toxin A chain (Yamaizumi, et al. (1978) Cell 15:245-250) or ricin A chain 
(Eiklid, et ah (1980) Exp, Cell Res/ 126:321-326) having been sho%vn to be sufficient to kill a 
eukaryotic cell. 

* 

International Patent Publication No. y/O99/401 85 describes libraries of mutant toxins 
in which mutations are introduced into the binding domain to alter the type of cells to which 
the toxic species are delivered. The new proteins are derived by mutating a binding subunit 
of the wild type heleromeric protein cytotoxic protein to create a library of microorganism 
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clones producing mutant proteins, which are then screened for the ability to specifically bind 

to and kil 1 a target cell type. 

• ■ 

US Patent No. 5,552,144 discloses a Shigella-like toxin II variant to which a mutation 
is introduced into the A chain at position 167 to change the amino acid at this position to one 
with a different charge. This resulted in a toxin with less of the enzymatic activity associated 
with toxicity. 

US Patent No. 6,593,132 describes recombinant toxic proteins which are specifically 
toxic to diseased cells but do not depend for their specificity of action on a specific cell 
binding component. The recombinant proteins of the '132 patent have, an A chain of a 

i 

ricin-like toxin linked to a B chain by a synthetic linker sequence which may be cleaved 
specifically by a protease localized in cells or tissues affected by a specific disease to liberate 
the toxic A chain thereby selectively inhibiting or destroying the diseased cells or tissues. 

< * 

US Patent No. 6,649,742 disclosed Type I ribosome-inactivating proteins (RIPs) and 

« 

' analogs of the RIPs having a cysteine available for disulfide bonding to targeting molecules. 
The RIPs and RIP analogs are used as components of cytotoxic therapeutic agents to 

* ■ 

selectively eliminate any cell type to which the RIP component is t argeted by the specific 
binding capacity of the second component of the agent. 

■ 

Summary of the Invention 

* 

The present provides combinatorial protein libraries comprising a plurality of protein 
species, in which each protein species comprising an A chain of a fc.eter&meric toxic protein 
into which an insert has been introduced. In accordance with the invention, the insert is a 
polypeptide of varying amino acid sequence having a length of 2 or more amino acid 
residues, for example from 3 to 200 amino acid residues; amd the insert is introduced into the 
protease-sensitive loop of the A chain sequence. The result of the :ntroduction of the insert 
creates an artificial binding domain within the A chain,; such that th-5 A chain develops toxic 
specificity which is independent of and different from the normal specificity associated with 
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the B chains binding domains. Screening of the library allows thci selection and identification 
of mutant toxins that are specific for different cell types, including cancer cell types. In one 
embodiment of the invention, the combinatorial library comprises protein species that are 
formed by introducing the insert into a Shiga-like toxin I A chain, for example in the region 
between amino acids 242 and 261 , as defined with reference to Seq. ID No. L 

The invention also provides a combinatorial expression library comprising a plurality 
of species of expression systems, each species expressing a protein species comprising an A 
chain of a heteromeric toxic protein into which the insert as described above has been 
introduced. Expression of protein from the combinatorial expression library results in the 
fonnation of a combinatorial protein library. 

In a further aspect, the invention provides a composition for treating melanoma, and 
methods for using such a composition. The composition comprises a protein species 
comprising an A chain of a heteromeric toxic protein into which a polypeptide having a 
length of 2 or more amino acid residues, . for example from 3 to 200 amino acid residues, is 
introduced into the protease-sensitive loop of the A chain sequence. The insert is selected 
such that protein species has toxic activity towards melanoma cells . The protein species is 
used in the treatment of melanoma by administering it to a patient diagnosed with melanoma, 
in an amount sufficient to result in the reduction of the number of living melanoma cells, 

In a further aspect^ the invention provides a composition for treating other types of 
cancer. The composition comprises a protein species comprising an A chain of a heteromeric 
toxic protein into which a polypeptide having a length of 2 or more amino acid residues, for 
example from 3 to 200 amino acid residues, is introduced into the protease-sensitive loop of 
the A chain sequence. The insert is selected such that protein species has toxic activity 
towards the cancer cells. In a specific embodiment, the insert is selected to bind to MUC-1 .. 
receptors. The protein species is used in the treatment of melanoma by administering it to a 
patient diagnosed with melanoma, in an amount sufficient to result in the reduction of the 
number of living melanoma cells. 
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* 

In a further aspect, the invention provides a method for identifying ligands that bind to 
specific targets/receptors, such as tumor markers knovfti to exist on cancer cells. In this 
method, the toxin of the combinatorial library serves as a reporter, and a combinatorial 
protein library in accordance with the invention is screened against cells known to possess the 

target/receptor. Proteins which are shown to be toxic to the cells zre evaluated to determine 

• ■ 

the sequence of the inserted region. Peptides of this sequence can then be used, in 
combination with a toxin or other molecules, to direct compounds to cells possessing the 
target/receptor. 

In yet a further aspect of the invention, a method is provided for identifying toxic 
substances specific for a known cell marker. In this embodiment of the invention, the toxin 
need not serve as reporter. Thus, cells having the marker, or an isolated target/receptor, 
where available, are exposed to the combinatorial protein library, [n preferred embodiments, 
the cells or the isolated target/receptor are immobilized on a solid support, such as in plastic 
wells. Captured proteins from the library are then rescreened against cells to confirm their 
toxicity and specificity for cells expressing the target/receptor, and their suitability for use as 
a therapeutic. . 

* 

Brief Description of the Drawings 

* 

Fig. 1 is a schematic diagram depicting the Al and A2 domains of SLT-1. The A 
chain is composed of 293 amino acids. The chain is cleaved by fuxin to produce an Al 
catalytic fragment and an A2 C-terminai tail non-covalently associated with the B pentamer, 
A protease-sensitive loop (stripped area) is defined by the only two cysteine residues in the A 

i 

chain (Cys 242 and 261). Tyr77, Glul67, Argl70 and Trp203 represent residues crucial for 
the catalytic activity of the Al domain (arrows). 

Fig. 2 A is a schematic representation of the SLT-1 A chain ^1-293) with the breast 
cancer-associated MUC1 epitope PDTRPAP (control sequence recognized by the mAb One 
M27) inserted between residues 245 and 246 and a 6-Histidine tag at its N-temiinus. 
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Fig. 2 B is a depiction of out SLT-1 A chain-tripeptide library construction where the 
three key positions of the MUC1 epitope recognized by the mAb One M27 were randomized 
(XXX region). The tripeptide library was inserted in a naturally occurring loop region of the 

A chain created by the presence of a disulfide bridge between Cy$ 242 and Cys 261. 

* « 

■ 

■ 

Fig. 3 shows a representative ELISA data set from screening 96 distinct single A 
chain variants from our SLT-1 A chain-tripeptide library with mAb One M27. Toxin variant # 

■ w 

41 (Tables 1 and 2) gave a strong ELISA signal and had the expected epitope. 

Fig. 4 shows a schematic diagram of a 7-amino acid random segment inserted 
between residues 245 and 246 of the A chain. 

Fig. 5 shows the results of tests on seven toxin variants that, were identified as 
repeatable killers of the human melanoma cell line 518A2, The absicissa represents the log 
concentration of toxin used to treat the cells and the ordinate depicts the observed percentage 

* 

of cells that are viable after 48 hours. The closed triangles depicts the effect of the wild type 
toxin on 518A2 cells while the two most efficacious A chain variants were termed SAM#3 
(open squares) and SAM#5 (X symbols). 

Detailed Description of the Invention - 

The present invention relates to a combinatorial protein libiary comprising a plurality 
of protein species, each protein species comprising an A chain of a heteromeric toxic protein 
into which an insert has been introduced. The insert is a polypeptide of varying amino acid 
sequence having a length of 2 or more amino acid residues, for example from 3 to 200 amino 
acid residues; and is introduced into the protease-sensitive loop of the A chain sequence. 
The library provides a collection of protein species that can be screened for individual 
proteins that are toxic against specific cell types, such as specific czneer cell types. 
Individual protein species thus selected are suitably used in the treatment of the cancer. 
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As used in the specification and claims of this application, the term "combinatorial 
library" refers to a mixture of species each of which has a comrno] i portion and a variable 
portion. In the case of a "combinatorial protein library" each of trie species is a protein or 
peptide, and the common portions and the variable portions are each amino acid sequences. 
In the case of a combinatorial expression library, the species are microorganisms, expression 
vectors or polynucleotides which, when expressed, produce proteins or peptides having 
common portions and variable portions. In this case, the common portions and the variable 
portions are each nucleotide sequences. Since the purpose of the combinatorial library is to 
provide multiple variants for screening purposes, the combinatorial library preferably 
contains at least 100 distinct species of protein or expression unit, more preferably at least 
1000 distinct species. 

As used in the specification and claims of this application, lie term "heteromeric toxic 
protein" refers to the class of protein toxins with the common orgajiization theme of being 
heteromeric in nature with two or more polypeptide domains or subunits responsible for 

♦ 

distinct functions (Merritt, E.A., and Hoi, W.GJ. (1995) Curr. Opin. Struct. Biol. 5:165 1). In 
such proteins^ the two or more subunits or domains could be referred to as A and B, and the 
toxins as AB X toxins where x represents the number of identical or homologous B subunits in 
the toxin. This family of framework-related toxins includes examples such as Shiga and 
Shiga-like toxins, the E. coli heat-labile enterotoxins, cholera toxin, diphtheria toxin, 
pertussis toxin, Pseudomonas aeruginosa exotoxin A as well as plaat toxins such as ricin and 
abrin. Based on their ability to block protein synthesis, proteins such as Shiga and Shiga-like 
toxins as well as ricin, abrin, gelonin, crotin, pokeweed antiviral protein, saporin, momordin, 
modeccin, sarcin, diphtheria toxin and exotoxin A have been referred to as ribosome- 
inactivating proteins (RJP). In these naturally-occurring heteromeric toxic proteins, the A 
chain is the toxic portion, while the B chains form a binding moiety winch binds to a receptor 
on a cell susceptible to the toxin, thereby delivering the A chain to ihe cell. 

One specific example of the A chain of a heteromeric toxic protein is the A chain of 
SLT-1 which has the sequence given is Seq. ID No. 1. The A chain of SLT-1 comprises of - 
293 amino acids with the enzymatic (toxic) domain spanning residv.es 1 to 239. A protease 
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sensitive loop encompassing residues 242 to 261 is normally exposed, and is a suitable site 
for inserting a peptide sequence. 

SLT-1 is a type II ribosome inactivating protein produced oy pathogenic strain of 
Escherichia coli (0157:H7) (24). SLT-1 is an AB5 complex of about 70 kD (O'Brien, A. D., 
and Holmes, R. K. (1987) Shiga and Shiga-like toxins. Microbiol Rev 5 1 , 206-220.). The 
single 32 kD catalytic A subunit is non-covalently associated with a pentamer of five identical 
7.7 kD B subunits. The B subunit pentamer recognizes the glycolipid globotriaosylceramide 
(also known as CD77 or Gb3) on the surface of target cells (Lingwood, C. A. (1993) 
Verotoxins and their glycolipid receptors. Adv Lipid Res 25, 189-21 1 ; Jacewicz, et al. (1986) 
Pathogenesis of shigella diarrhea. XL Isolation of a shigella toxin- binding glycolipid from 
rabbit jejunum and HeLa cells and its identification as globotriaosylceramide. J Exp Med 
163, 1391-1404). A protease-sensitive loop located between Cys242 and 261 at the C 
terminus of the A chain is cleaved by form during cellular routing (Fig- !)• The A chain 
remains associated with its B subunit pentamer due to an hitrachain disulfide bond between 
Cys242 and Cys26l as it travels to the ER lumen (Sandvig, et al. ( 1989) Endocytosis from 
coated pits of Shiga toxin: a glycolipid-binding protein from Shigella dysenteriae 1. J Cell 
Biol 108, 1331-1343;32. Garred, et al (1995) Role of processing acid intracellular transport 
for optimal toxicity of Shiga toxin and toxin mutants. Exp Cell Rev. 218, 39-49.). The 
- disulfide bond is finally reduced in the ER lumen and the Al chain (first 25 1 aa) is released 
' and subsequently retrotranslocated to the cytosol where it inactivates ribosomes (O'Brien, et 
al. (1992) Shiga toxin: biochemistry, genetics, mode of action, and role in pathogenesis. Curr 
Top Microbiol Immunol 180, 65-94.). More specifically, the A chain of SLT-1 is a 
N-glycosidase that catalytically cleaves a specific adenine nucleotide (4324) from the 28 S 
rRNA (Brigotti, et al. (1997) The KNA-N-glycosidase activity of S niga-like toxin I: kinetic 
parameters of the native and activated toxin. Toxicon 35, 1431-1437.). This event leads to 
the inhibition of protein synthesis by preventing the binding of amiioacyl tRNAs to the 
ribosome and halting protein elongation. Mutagenesis studies as well as structural analysis 
performed on the A chains of ST and ricin have delineated key conserved residues involved 
in catalytic activity (Deresiewicz, et al.(1992) Mutations affecting the activity of the 
Shiga-like toxin I A-chain. Biochemistry 31, 3272-3280; Ready, et al. (1991) Site-directed 
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mutagenesis of ricin A-chain and implications for the mechanism of action. Proteins 10, 
270-278). Residues crucial for catalytic activity of SLT-1 are tyrosine 77, glutamic acid 167, 
arginine 170 and tryptophan 203 (Hovde, et al. (1988) Evidence tliat glutamic acid 167 is an 

t 

active-site residue of Shiga-like toxin I. Proc Natl Acad Sci USA 85, 2568-2572; Yamasaki, 



70 of the A subunit of Vero toxin 1 produced 
activity. Microb Pathog 11, 1-9). In 



et al. (1991) Importance of arginine at position : 
by e-nterohemorrhagic Escherichia coli for toxin 
addition, binding of the toxin to the cell surface is critical to introduction into the cell and 
thus for toxic activity. Because of this the A chain alone is not significantly toxic. 



In addition to the A chain of SLT-1, other toxins may also be used to form libraries 
and compositions in accordance with the invention, and may be used in the methods of the 
invention. Specifically, Shiga and other Shiga-like toxins, as well, as ricin, abrin, gelonin, 
crotin, pokeweed antiviral protein, saporin, momordin, modeccin, sarcin, diphtheria toxin and 
exotoxin A, and other functionally-related ribosome inactivating p roteins (RIP) may be used. 

* • 

For purposes of making the combinatorial library of the invention, short amino acid 
sequences of at least 2 amino acids, for example of 3 to 200 amino acids residues in length, 
are inserted into this protease sensitive loop. The number of amino acids in the insert defines 
the number of possible random variants that can be in the library. !ror example, when the 
number of amino acids in the insert is 3, the maximum number of variants is 20 3 or 8000 
variants. Larger inserts provide corresponding larger numbers of possible variants. 

As an alternative to using inserts with purely random sequences, inserts can be 
designed based on a known template. For example, as described below in the context of 

Muc-1, variations in a sequence known to provide receptor binding properties for a particular 

* ■ ■ 

cell type can be used to identify an insert that provides for optimization of the toxic properties 

* 

of the protein construct. This same optimization may be performed, on an individual 
sequence isolated by screening of a larger combinatorial library. It will be appreciated, 
however, that the insert in the proof of principle tests of Example 1 use an insert which is the 
target/receptor, while in the actual case the insert would be based on the sequence of a known 
ligand to be optimized for maximum effectiveness and specificity. 
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This approach, in which a specific, known receptor type is targeted illustrates a further 
aspect of the invention, namely a method for identifying peptide ligands that bind to specific 
targets/receptors! such as tumor markers known to exist on cancer cells. In this method, a 
combinatorial protein library in accordance with the invention is screened against cells known 
to possess the target/receptor. The toxin serves as the reporter, such that proteins which are 
shown to be toxic to the cells are evaluated to determine the sequence of the inserted region. 
Peptides of this sequence can then be used, in combination with a toxin or other molecules, to 
direct compounds to cells possessing the target/receptor. Further resting peptides of the 
sequence of the inserted region may be appropriate to confirm thai: they are a ligand for the 
specific target/receptor, as opposed to some other receptor on the cell type. This can be done 
using binding assays with isolated receptor, where such are available. 

■ 

The invention also provides a method for identifying toxic substances specific for a 
known cell marker, and particularly markers that are available in isolated form. In this 
embodiment of the invention, the toxin need not serve as reporter. Thus, cells having the 
marker, or an isolated target/receptor, where available, are exposed to the combinatorial 
protein library. In preferred embodiments, the cells or the isolated target/receptor are 
immobilized on a solid support, such as in plastic wells. Captured proteins from the library 
are then rescreened against cells to confirm their toxicity and specificity for cells expressing 
the target/receptor, and their suitability for use as a therapeutic. Ttiis method can be used to 
identify toxins with binding inserts specific for any tumor marker or cell receptor, including 
without limitation tumor markers such as mucins such as MUC-1 «nd its glycoforms, Her-2, 
Her2-Neu, tyrosine kinase markers, EGFR, GD2, and GD3. 

Thus, in accordance with this specific aspect of the invention, a method for isolating 
a toxin specific for a known target/receptor is provided that comprises the steps of: 

(a) exposing the target/receptor to a combinatorial protein library comprising a 
plurality of protein species, each protein species comprising an A c.'iain of a toxic protein int. 
which an insert has been introduced, wherein, 

the insert is a polypeptide of varying amino acid sequence having a length of ; 

least 2 amino acid residues; and 
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* 

the insert is introduced into the pro tease-sensitive loop of the A chain 

sequence; and 

• .■ 

(b) isolating at least one protein species from the combinatorial protein library 
captured by binding to the target/receptor. As used in the specification and claims hereof, 
the term "isolating" refers to any mechanism for obtaining a composition containing the 
protein separated from the milieu in which it is expressed in a form suitable for further 
analysis. This would include release from the target/receptor following capture (for example 
by exposure to a competitive binding agent) or isolation from a culture of a clone expressing 
the protein found to be captured. The method may further compri ses the step of screening 

the isolated protein against cells expressing the target/receptor, to confirm their toxicity for 

i 

cells expressing the target/receptor. Procedures suitable for this screening are described in 
Example 3. As noted above, in this method, the target/receptor may be a purified 

■ 

target/receptor and may be immobilized on ja solid support. The target/receptor may also be 
on the surface of cells, which may be immobilized. Where the tai get/receptor is on the 
surface of cells, the toxin can serve as a reporter, and the death of ihe cells is indicative, of 
receptor binding. 

* ' > * 

♦ * • 

Our experience with constructing SLT-1 libraries has pointed out a number of 
practical issues that are appropriately considered in selecting a protein template for building 
combinatorial libraries. An important factor is to choose a single chain protein, preferably a : 
bacterial protein of less than 300 amino acids; (if the libraries are to be expressed in 
prokaryotes). Use of a smaller toxin both increases its potential to penetrate into solid tumors 
and reduces its immunogenicity. Secondly, the protein template should spontaneously fold in 

• * 

solution into its active form such that th$re is minimal need for hos t's chaperones. One 
should avoid for example scaffolds that contain multiple cysteine residues normally involved 
in disulfide bridges. In addition, a single chain protein as opposed to a multi-subunit complex 
may be more easily exported from bacteria. Thirdly, the protein template should possess an 
enzymatic activity, which can rapidly be measured to confirm the proper folding of peptide 

* 

variants containing single and multiple site-directed mutations. Fooxthly, a simple screening 
approach should be incorporated into the design of combinatorial libraries. Such searches 
should be amenable to high-throughput screening methods. The catalytic A chain of SLT-1 
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(residues 1 to 293) meets these criteria because it is a single chain that lacks any known 
receptor binding function, and that has a well defined structure and catalytic site. 

ft 

A further aspect of the invention is a combinatorial expression library comprising a 

plurality of species of expression systems. Each species within the expression library 

* ' ' • 

expresses a protein species comprising an A chs^n of a heieromeric toxic protein into which 

an insert has been introduced. The insert is a polypeptide of varyiag amino acid sequence 

* ■ * 

having a length of 2 or more amino acid residues, for example from 3 to 200 amino acid 
residues; and is introduced into the protease-sensitive loop of the A chain sequence, Suitable 
expression systems include plasmids and viral vectors. 

The invention will now be further described with reference to the following, non- 
limiting examples. 

Example 1 

Proof-of-concept: design and mining of a prototypic A chain-tripeptide library 

We originally created a simple tripeptide library inserted in the C-terminal 
protease-sensitive loop of the SLT-1 A chain (Figij. 2A and B). His A chain loop region is 

■ 

naturally constrained due to the presence of a single disulfide bond bridging Cys242 to 
Cys26 1 . The maximal diversity of this library can thus be calculated to be 20 3 or 8000 
permutations' of a tripeptide sequence. As a proof-6f- concept that A chain libraries can easily 
be screened for a new receptor-binding activity, we picked more than 3000 colonies from this 

A chain-tripeptide library and purified the mutant toxin produced ty each clone. We noticed 

I I 

very early in this study that the level of expression ;of A chain mutsnt was dramatically 
increased when expressed in the presence of the wild-type SLT-1 is subunit. Thus the mutant 

forms of A chain were expressed and initially purifiedj as AB5 toxi?i variants. Since all A 

I " • ■ 

subunits harbor a polyHis purification tag, it is relatively easy to remove the B subunit with 

denaturants (urea for example) while recovering the A chain on metal-affinity columns or 

beads. Western blots performed on randomly selected bacterial clones indicated that > 70% of 

these colonies produced significant amounts of these A chain mutants. 



- 11 - 



WO 2005/092917 



PCT/CA2004/000443 



These toxin variants were then coated in individual wells of 96-weli plates and 
screened by ELISA for their ability to bind the monoclonal antibody One M27 (Linsley, et 
al.(1988) Monoclonal antibodies reactive with mucin glycoproteins found in sera from breast 
cancer patients. Cancer Res 48, 2138-2148.), directed at the well-characterized breast cancer 
tripeptide epitope Thr-Arg-Pro of the human MUC1 tandem repeat (Gendler, et al. (1988) A 
highly immunogenic region of a human polymorphic epithelial mucin expressed by 
carcinomas is made up of tandem repeats. J Biol Chem 263, 12820-12823; Girling, et al. 
(1989) A core protein epitope of the polymorphic epithelial mucin detected by the 
monoclonal antibody SM-3 is selectively exposed in a range of primary carcinomas. Int J 
Cancer 43, 1072-1076). As shown in Tables 1 and 2, moss A chain mutants neither coded for 
the tripeptide insert that matched the targeted epitope of One M27 nor were recognized by the 
antibody. However on two occasions, a toxin variant harboring ths exact epitope (Table 1, 
Fig.3) gave a strong ELISA signal comparable to the one observed with our control A chain 
harboring the MUC1 tripeptide epitope and had the expected epitope sequence in its 
randomized tripeptide region. A typical ELISA data set for 96 A chain mutants is presented in 
Fig. 3, highlighting the fact that the majority of A chain mutants did not recognized the mAb 
One M27 except for an A chain variant (mutant #41 in Tables 1 and 2). These results clearly 

established tliat A chain libraries can easily be constructed and screened to find toxin variants 

1 

able to specifically target a given receptor, in this case an antigen-combining site, 

* ■ 

Example 2 

I 

Making of Combinatorial SLT-1 A-heptapeptide Library 



Library diversity represents a crucial parameter in screening combinatorial libraries 
for ligands able to bind specifically and with high affinity to a particular target. The SLT-1 A 
chain-tripeptide library described in Example 1 has a maximal diversity of 20 3 or 8000 
possible mutated A chains. This sampling repertoire is small but v*as useful in mapping a 
tripeptide epitope to establish our proof-of-concept. A seven-residue library (20 7 or 1,3 x 10 9 
possible mutants) represents a more typical minimal diversity level commonly used in 



designing phage display as well as synthetic peptide libraries. Thus, as a starting point, we 
built a SLT-1 A chain library with a 7 -amino acid long random sequence inserted in its 
C-terminus. This library was installed in the A chain's protease sensitive loop, a loop region 
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naturally constrained by a disulfide bond. This library provided sufficient diversity to insure 
that A chain toxin variants can be identified that target new or known internalized receptors 
on cancer cells. AU elements of this library-(as well as all other libraries proposed) contain a 
N-terminal His tag to quickly purify A chain mutants . The library (Fig. 4) was generated 
using a megaprimer PCR strategy (Sarkar G, and Somniers S, (1930). The 'megaprimer' 
method of site-directed mutagenesis. Biotechniques 8, 404-407). The megaprimer strategy is 
widely used to introduce mutations into target DNA sequence, by way of two rounds of PCR 
that use two flanking primers and an internal mutagenic primer. A description of the library 

* 

design for the SLT-1 A chain-heptapeptide library will serve as th-2 example for other future 
single A chain libraries. The SLT-1 A chain-heptapeptide library (Fig. 5) harbors a 7-amino 
acid random insertion between amino acid 245 and 246 of the A chain, the identical site used 
to construct our SLT-1 A chain-trip eptide library (Tables 1 , 2, Fig 3). Briefly, two flanking 
primers A (GTT ACT GTG ACA GCT GAA GCT TTA CGT TTT CG (Seq. ID No. 2) and B 
(GAG A AG AAG AGA CTG CAG ATT CCA TCT GTT G (Seq. ID No. 3)) carrying 
Hindi 1 1 and Pstl restriction sites respectively were annealed within the 5* and 3' ends of the 
SLT-1 operon. A library oligonucleotide F containing all seven random amino acid (NNS) as 
. well as a long matching sequence to anneal to the template were synthesized. In the synthesis 
of the random oligonucleotide, the relative representation of each iimino acid was improved 
by restricting the third position of each codon to G or T (Noren, K. A., and Noren, C. J. 
(2001) Construction of high-complexity combinatorial phage display peptide libraries. 
Methods 23, 169-178,). This type of restriction reduces the overall DNA sequence 
complexity as well as coding discrepancy between residues (Reidhaar-Olson, et al. (1991) 
Random mutagenesis of protein sequences using oligonucleotide cassettes. Methods Enzymo: 

• ■ 

208, 564-586. ), This strategy also minimizes the occurrence of stop codons (TAA and TGA 
while the stop codon (TAG) is suppressed by using a supE bacterial strain, which specifies 

« 

for the insertion of a Gin residue when TAG codons are translated. The first PCR reaction 

* ■ 

was performed using primers A and F and the resulting product purified by denaturing 
polyacrylamide gel electrophoresis. This product then served as the megaprimer with primer 
B for a second PCR reaction to amplify the random DNA. The final library DNA (PCR 
product) will then be digested with Hindi 1 1 and Pstl and cloned into the backbone of a 
pECHE9a expression vector (MTI, Toronto). The E. coli strain JM 1 01 was subsequently 

- 13 - 
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transformed with the resulting pECHE vector and single bacterial colonies picked, lysed and 
their supematants analyzed for the expression of single A chain toxins or layered on cancer 

♦ ■ 

cells and screened using the SRB for cell cytotoxicity assay. 

Example 3 

Mining the combinatorial SLT-1 A-heptapeptide librairy against cancer cell lines using a 
cytotoxicity assay 

We screened our SLT-1 A-heptapeptide library using the cytotoxic function of the A 
chain as a reporter signal. Cytotoxicity is a more informative property to measure than 
binding to a receptor since it implies that the toxin is internalized, processed and delivered 

* 

near ribosomes, clearly a multi-step event. The cytotoxicity assay was essentially performed 
as previously described (Bray, et ah (2001) Probing the surface of sukaiyotic ceils using 
combinatorial toxin libraries. Current Biology 11, 697-701). Brieily, the strategy to screen all 
our A chain libraries was based on the following principles;. Established cancer cell lines such 
as SK-BR-3 (human breast), CAMA-1 (human breast), 518A2 (huoian melanoma), PC3 
(human prostate) and B 16 (murine melanoma) were grown in 96-well plates and used as 
targets in the primary screen stages. These cell lines were initially selected for our holotoxin 
library searches (Bray, supra) based on their adherence (plastic), their cell viability staining 
properties (SRB) in a high-throughput screening setting as well as their lack of receptor and 
sensitivity to native SLT-1 (to insure a reduced level of false positives). Single bacterial 
colonies from each library were picked and (grown in 96 deep well plates, The cells were 
harvested, lysed, and their lysates clarified. Since all expressed SLT-1 A chain variants have 
a 6 histidine tag at their N-terminus, each of them was puriSed from their lysate using 
nickel-affinity beads (96-well format) and layered on target cells. The plates containing the 
target cells treated with A chain variants were then incubated at 37°C for 48 hours, followed 
by fixation and staining with Sulforhodamine B (SRB). The SRB assay is a colorimetric 
end-point assay, which quantifies viable cells by. staining their cellular protein content 
(Skehan, et al. (1990) New colorimetric cytotoxicity assay for anticancer-drug screening. J 
Natl Cancer Inst 82, 1 107-1 112.). The SRB assay has been adopted by NCI/NIH for their 
high-throughput screening of drug candidates on cancer cell lines. Viability assays were 
repeated for any bacterial extracts leading to cell death. A first round of screening was 
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performed on more than 5000 bacterial clones (equivalent to 5000 distinct A chain toxins) 
and 7 toxin variants were identified as repeatable killers of the human melanoma cell line 
518A2 (Fig. 5). The abscissa represents the log concentration of toxin used to treat the cells 
and the ordinate depicts the observed percentage of cells that are -viable after 48 hours. The 
closed triangles depicts the effect of the wild type toxin on 518A2 cells while the two most 
efficacious A chain variants were termed SAM#3 (open squares) Jind SAM#5 (X symbols). 

Seven promising A chain variants were men re-screened against a panel of cell lines 
(Vero [Monkey, normal kidney]; PC-3 [Human, prostate cancer]; HepG2 [Human, 
hepatoma]; SiHa [Human, cervical cancer]; PanC [Human, pancreatic cancer]; SKBR-3 
[Human, breast cancer]; 518-A2 [Human, melanoma]; U87 [Human, glioma]; B16-F10 
[Mouse, melanoma]; HS-216 [Human, normal fibroblast]; CAMA-1 [Human, breast cancer] 
OVCar-3 [Human, ovarian cancer]). Of these seven four were observed to observed to have 
activity against a cancer cell line, two for 518-A2 human melanoir,a, and one each for SiHa 
(human cervical cancer cells) and US7-A (human brain cancer cells; glioma). 

* 

The genes coding for the two A chain toxins (SAM3 and SAM 5) that resulted in 
toxicity of the human melanoma cell lines were sequenced to determine the amino acid 
sequences inserted between residues 245 and 246 of the wild-type A. chain. The sequences, 
including the His-tag, are listed in Seq. ID Nos 4 and 5, respectively, 
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Table 1; DNA sequences of randomly picked clones from the SLT-1 A chain-trip ep tide 
library. Mutated bases in bold characters. Mutant #41 was identified in our ELISA screen as 

* 

a strong binder of mAb One M27 (Fig. 3). 



10 



15 



20 



SLT-1 A chainNucleotide sequence 
variant 



Diversity 

(nucleotide changes in 
mutated region) 



MUCl epitopeCCA GAC ACG CGA CCA GCT CCA 0/9 




Mutant #1 


CCA GAC 


GGG ATC GGG 


GCT CCA 


8/9 


Mutant #2 


CCA GAC 


CTG GAG ATG 


GCT CCA 


8/9 


Mutant #3 


CCA GAC 


CCC CGT GGG 


GCT CCA . 


6/9 


Mutant #4 


CCA GAC 


GAC GAC TTG 


GCT CCA 


9/9 


Mutant #5 


CCA GAC 


GTC CGG TGG 


GCT CCA 


7/9 


Mutant #6 


CCA GAC 


CAG CGC TGG 


GCT CCA 


6/9 


Mutant #7 


CCA GAC 


CTC AGG ATG 


GCT CCA 


8/9 


Mutant #8 


CCA GAC 


TCC CAG GAG 


GCT CCA 


7/9 


Mutant #9 - 


CCAGAC 


TCC GAC CCC 


GCT CCA 


6/9 


Mutant #41 


CCA GAC 


ACG CGC CCC 


GCT CCA 


2/9 
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Table 2: Amino acid sequence alignment of randomly selected clones firom the SLT-1 A 
chain-tripeptide library and ELIS A signal of purified SLT-1 A chain variants detected with 
mAb (One M27) raised against the MUO epitope Thr-Arg-Pro. Mutated tripeptide region : 
bold characters. Mutant #41 was identified in our ELISA screen as a strong binder of mAb 
One M27. 



SLT-1 A chainDeduced amino acid sequence ELISA. readings 

variant (405 nm) 



MUCl epitope 


CHHHPD TRP 


APASRVAKMASDEFPSMC 


1.3 


Mutant #1 


CHHHPD GIG 


APASRVAKMASDEFPSMC 


0.08 


Mutant #2 


CHHHPD LQM APASRVAKMASDEFPSMC 


0.03 


Mutant #3 


CHHHPD PRG 


APASRVAKMASDEFPSMC 


0.03 


Mutant #4 


CHHHPD DDL 


AP ASRV ARMASDEFP SMC 


0.06 


Mutant #5 


CHHHPD VRW AP ASRV ARMASDEFP SMC 


0.07 


Mutant #6 


CHHHPD QRL, 


AP ASRV ARMASD EFP SMC 


0.06 


Mutant #7 


CHHHPD LRM 


APASRVAKMASDEFPSMC 


0.11 


Mutant #8 


CHHHPD SQE 


APASRVAKMASDEFPSMC 


. 0,13 


•-Mutant #9 


CHHHPD SDP 


APASRVAKMASDEFPSMC 


0.07 


Mutant #41 


CHHHPD TRP 


APASRVAKMASDEFPSMC 


1.25 
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